Multi-label large margin hierarchical perceptron
نویسندگان
چکیده
This paper looks into classification of documents that have hierarchical labels and are not restricted to a single label. Previous work in hierarchical classification focuses on the hierarchical perceptron (Hieron) algorithm. Hieron only supports single label learning. We investigate applying several standard multi-label learning techniques to Hieron. We then propose an extension of the algorithm (MultiHieron) that significantly outperforms all previously mentioned techniques. MultiHieron has a new aggregate loss function for multiple labels. Improvement is shown on the Aviation Safety Reporting System (ASRS) flight anomaly database and OntoNews corpus using both at and hierarchical categorisation metrics.
منابع مشابه
Hierarchical Multi-Label Text Categorization with Global Margin Maximization
Text categorization is a crucial and wellproven method for organizing the collection of large scale documents. In this paper, we propose a hierarchical multi-class text categorization method with global margin maximization. We not only maximize the margins among leaf categories, but also maximize the margins among their ancestors. Experiments show that the performance of our algorithm is compet...
متن کاملHierarchical multi-label classification using local neural networks
Hierarchical Multi-Label Classification is a complex classification task where the classes involved in the problem are hierarchically structured and each example may simultaneously belong to more than one class in each hierarchical level. In this paper, we extend our previous works, where we investigated a new local-based classification method that incrementally trains a multilayer perceptron f...
متن کاملExtension of TSVM to Multi-Class and Hierarchical Text Classification Problems With General Losses
Transductive SVM (TSVM) is a well known semi-supervised large margin learning method for binary text classification. In this paper we extend this method to multi-class and hierarchical classification problems. We point out that the determination of labels of unlabeled examples with fixed classifier weights is a linear programming problem. We devise an efficient technique for solving it. The met...
متن کاملSurrogate Functions for Maximizing Precision at the Top
The problem of maximizing precision at top, also dubbed Precision@k, finds relevance in myriad learning applications such as ranking, multi-label classification, and learning with severe label imbalances. Despite its popularity, Precision@k is not known to have a surrogate function that upper bounds it. Similarly, notions of consistency under certain noise/margin conditions are also not explore...
متن کاملSemi-supervised Multi-label Classification - A Simultaneous Large-Margin, Subspace Learning Approach
Labeled data is often sparse in common learning scenarios, either because it is too time consuming or too expensive to obtain, while unlabeled data is almost always plentiful. This asymmetry is exacerbated in multi-label learning, where the labeling process is more complex than in the single label case. Although it is important to consider semisupervised methods for multi-label learning, as it ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJDMMM
دوره 1 شماره
صفحات -
تاریخ انتشار 2008